13 Multiple Variables

#PolarCoordinate #Orthogonal #Transformation #GammaDistribution #BetaDistribution

1 Joint Distribution

Now we want to explore multiple variables and their interaction. Here we focus on bivariate jointly continuous RVs.

Jointly Continuous

$(X, Y)$ are jointly continuous if $\exists$ a function with joint density function $f (x, y)$ , s.t. $P ((X, Y) \in B) = \iint_{B} f (x, y) d x d y,$
For $B \subset R^{2}$ measurable.
So $f_{X, Y} (x, y) = lim_{Δ \to (x, y)} \frac{P ((x, y) \in Δ)}{Area (Δ)} .$

A joint density function satisfies

$\iint_{R^{2}} f (x, y) d x d y = 1$ ;
$f (x, y) \geq 0$ .

Example: Uniform

Let $D \subset R^{2}$ , we can choose a point $(X, Y)$ uniformly in $D$ : $f_{X, Y} (x, y) = \frac{1_{D} (x, y)}{Area (D)} .$

Example: Two independent standard normal distribution

$f_{X, Y} (x, y) = \frac{1}{2 π} e^{- \frac{x^{2} + y^{2}}{2}} .$

In the one dimensional case, for continuous RV, we have the approximation $P (X \in [x, x + ε)) \approx f_{X} (x) ε .$
Similarly in two dimentional case, take a small neighborhood $Δ$ containing $(x, y)$ , then we have $P ((X, Y) \in Δ) \approx f_{X, Y} Area (Δ) .$

How to recover the marginal density $f_{X}$ or $f_{Y}$ given $f_{X, Y}$ ?

P (X \leq t) = \int_{- \infty}^{t} \int_{- \infty}^{\infty} f_{X, Y} (x, y) d y d x = \int_{- \infty}^{t} f_{X} (x) d x .

Fact

$f_{X} (x) = \int_{R} f_{X, Y} (x, y) d y = \int_{R} f_{X, Y} (x, y) d x .$

Proof

We can only prove $f_{X}$ . Note that $P (X \leq t) = P (X \leq t, - \infty < Y < \infty) = \int_{- \infty}^{t} \int_{- \infty}^{+ \infty} f_{X, Y} (x, y) d y,$ differentiate on both sides.

$X \overset{d}{=} W, Y \overset{d}{=} Z$ will not lead to $(X, Y) \overset{d}{=} (W, Z)$ : let $X, W, V \overset{i . i . d}{\sim} Exp (1)$ , and $Z \sim Gamma (2, 1)$ . Let $Y = X + V$ . By this result we know $Y \overset{d}{\sim} Gamma (2, 1)$ . Then $X \overset{d}{=} W, Y \overset{d}{=} Z$ . However, we always have $X \leq Y$ , and $W > Z$ has positive probability, so $(X, Y) \overset{d}{\neq} (W, Z)$ .

1.1 Independence

$X, Y$ are independent if $f_{X, Y} (x, y) = f_{X} (x) f_{Y} (y), \forall (x, y) \in R^{2} .$
By independence, the value of $X$ will not give us anything about $Y$ .

Example

$f_{X, Y} (x, y) = 1 {0 \leq x \leq a} 1 {0 \leq y \leq b}$ . Calculate the marginal distribution:
$f_{X} (x) = 1 {0 \leq x \leq a}, f_{Y} (y) = 1 {0 \leq y \leq b}$ . So $X ⊥ ⊥ Y$ .
For $f_{X, Y} = \frac{1}{π r^{2}} 1 {x^{2} + y^{2} \leq r^{2}}$ , $f_{X} (x) = \int_{- \sqrt{r^{2} - x^{2}}}^{\sqrt{r^{2} - x^{2}}} f_{X, Y} (x, y) d y = \frac{2}{π r^{2}} \sqrt{r^{2} - x^{2}} 1 {- r \leq x \leq r} .$ $f_{Y} (y)$ is similar. Then $X, Y$ are not independent.

2 Bivariate Transformation

Transformation of random variables: $T : R^{2} \to R^{2}, (X, Y) \mapsto (W, Z)$ .
Polar coordinates: $(x, y) \mapsto (r, θ), x = r \cos θ, y = r \sin θ$ .

Fact (Polar Coordinates)

$f_{R, Θ} (r, θ) = r f_{X, Y} (r \cos θ, r \sin θ) .$

Proof

On one hand, $P (r \leq R \leq r + δ, θ \leq Θ \leq θ + ε) \approx f_{R, Θ} (r, θ) δ ε .$
On the other, by describing the event using $(X, Y)$ we have $\begin{aligned} P (r \leq R \leq r + δ, θ \leq Θ \leq θ + ε) = P ((X, Y) \in Δ) \\ \approx & f_{X, Y} (x, y) Area (Δ) = f_{X, Y} (r \cos θ, r \sin θ) r δ ε . \end{aligned}$
Putting together we conclude.

Linear transformation

$T : R^{2} \to R^{2}$ is a linear transformation if $T (\binom{x}{y}) = M_{T} (\binom{x}{y}) + P_{T} . M_{T} \in R^{2 \times 2}, P_{T} \in R^{2 \times 1} .$

Some properties:

$T$ is invertible if and only if $M_{T}$ is invertible;

$P$ is a parallelogram, then $T (P)$ is also one;

$Area (T (P)) = Area (P) \cdot \det (M_{T})$ .

Let $T$ be a linear transformation with inverse $S$ . Given the joint p.d.f of $(X, Y)$ , what's the joint p.d.f of $(W, Z) = T (X, Y)$ ?
On one hand, $P ((W, Z) \in P) \approx f_{W, Z} (w, z) Area (P) .$
Similarly $\begin{aligned} P ((W, Z) \in P) & = P (T (X, Y) \in P) = P ((X, Y) \in S (P)) \\ \approx f_{X, Y} (x, y) Area (S (P)) = f_{X, Y} (S (w, z)) Area (P) | det (M_{S}) | . \end{aligned}$
We conclude that for invertible transformation $T$ ,

f_{W, Z} (w, z) = f_{X, Y} (S (w, z)) | \det (M_{S}) | .

Rotations

In positive direction, $T_{θ} = (\begin{matrix} \sin θ, & \cos θ \\ \cos θ, & - \sin θ \end{matrix}), P_{T_{θ}} = (\binom{0}{0})$ . $(X, Y) \overset{T_{θ}}{\to} (X (θ), Y (θ))$ . Then $f_{X (θ), Y (θ)} (w, z) = f_{X, Y} (w \cos θ - z \sin θ, w \cos θ + z \sin θ) .$

Sum & difference

$f_{X + Y, X - Y} (w, z) = \frac{1}{2} f_{X, Y} (\frac{w - z}{2}, \frac{w + z}{2}) .$

This is the rotation of $θ = \frac{π}{4}$ .

Orthogonal transformation

$T$ is an orthogonal transformation if it preserves the inner product: $⟨ \vec{v}, \vec{w} ⟩ = ⟨ T \vec{v}, T \vec{w} ⟩$ . I.e.: $P_{T} = 0$ , $M_{T}$ is an orthogonal matrix, $M_{T}^{- 1} = M_{T}^{T}$ .

They preserve angles, lengths, Areas, $\det (M_{T}) = \pm 1$ .

Fact

For $R^{2}$ , all orthogonal transformations are rotation, reflection, and composition of the two.

3 Invertible Affine Transformation

Suppose $T : R^{2} \to R^{2}, T (X, Y) = (U, V)$ has inverse $S (U, V) = (X, Y)$ .
Pasted image 20241201150847.png|400
Define linear translation $T (X, Y) = M_{T} [\begin{matrix} X \\ Y \end{matrix}] + P_{T} = [\begin{matrix} U \\ V \end{matrix}],$ where $M_{T}$ is $2 \times 2$ invertible matrix, and $P_{T}$ is $2 \times 1$ vector.
Since $S T (X, Y) = S (U, V) = (X, Y)$ , $S (U, V) = M_{T}^{- 1} [\begin{matrix} U \\ V \end{matrix}] - M_{T}^{- 1} P_{T} = M_{S} [\begin{matrix} U \\ V \end{matrix}] + P_{S} .$
Since $\begin{aligned} P ((U, V) \in B) & = P ((X, Y) \in S (B)) \\ \Rightarrow \iint_{B} f_{U, V} (u, v) d u d v & = \iint_{S (B)} f_{X, Y} (x, y) d x d y, \end{aligned}$ we have $\begin{matrix} (3.1) & f_{U, V} (u, v) = f_{X, Y} (S (u, v)) | det M_{S} | . \end{matrix}$

Example

If $U = X + Y, V = X - Y$ , then $X = \frac{1}{2} (U + V), Y = \frac{1}{2} (U - V)$ .
Pasted image 20241201151330.png|400
Let $Δ = a$ be a small region containing $(u, v)$ . Then $\begin{aligned} P ((U, V) \in Δ) & = P ((X, Y) \in S (Δ)) \\ \Rightarrow f_{U, V} (u, v) Area (Δ) & = f_{X, Y} (\frac{u + v}{2}, \frac{u - v}{2}) Area (S (Δ)) \\ \Rightarrow f_{U, V} (u, v) & = \frac{1}{2} f_{X, Y} (\frac{u + v}{2}, \frac{u - v}{2}) . \end{aligned}$
Algebraically, $T (\begin{matrix} X \\ Y \end{matrix}) = (\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}) (\begin{matrix} X \\ Y \end{matrix}) = (\begin{matrix} U \\ V \end{matrix}),$ and $M_{T} = (\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}), M_{S} = \frac{1}{2} (\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}), M_{S} M_{T} = I_{2},$ so we also have $f_{U, V} (u, v) = \frac{1}{2} f_{X, Y} (\frac{u + v}{2}, \frac{u - v}{2}) .$

4 General Invertible Transformations

$T : R^{2} \to R^{2}$ . Assume differentiable, but not necessarily affine. Also assume $T (X, Y) = (U, V)$ , $S (U, V) = (X, Y)$ , $S T (X, Y) = (X, Y)$ . So $\begin{aligned} P ((U, V) \in Δ) & = P ((X, Y) \in S (Δ)) \\ \Rightarrow f_{U, V} (u, v) Area (Δ) & \approx f_{X, Y} (S (u, v)) Area (S (Δ)) . \end{aligned}$
We want to know $S (Δ)$ .
If $S$ is affine, then $S (Δ)$ is a parallelogram. For general $S$ , if $δ, ε ≪ 1$ , then $S (Δ)$ can be approximated by a parallelogram, since $S$ can be approximated by an affine transformation on $Δ$ .
Let $S (\begin{matrix} u \\ v \end{matrix}) = [\begin{matrix} S_{1} (u, v) \\ S_{2} (u, v) \end{matrix}] \in R^{2},$ where $S_{i} : R^{2} \to R, i = 1, 2$ are differentiable functions. Then for any point $(a, b)$ near $(u, v)$ , the Taylor expansion in second order gives $S_{i} (a, b) \approx S_{i} (u, v) + (a - u) \frac{\partial S_{i}}{\partial u} (u, v) + (b - v) \frac{\partial S_{i}}{\partial v} (u, v) .$
In matrix notation, $S [\begin{matrix} a \\ b \end{matrix}] \approx [\begin{matrix} S_{1} (u, v) \\ S_{2} (u, v) \end{matrix}] + [\begin{matrix} \frac{\partial S_{1}}{\partial u} (u, v) & \frac{\partial S_{1}}{\partial v} (u, v) \\ \frac{\partial S_{2}}{\partial u} (u, v) & \frac{\partial S_{2}}{\partial v} (u, v) \end{matrix}] [\begin{matrix} a - u \\ b - v \end{matrix}] .$ This is an affine transformation. Denote the yellow matrix as $J_{S} (u, v)$ , which is the Jacobian matrix of $S$ at $(u, v)$ .

Hence, $P ((X, Y) \in S (Δ)) \approx f_{X, Y} (S (u, v)) | det J_{S} (u, v) | Area (Δ),$ so $\begin{matrix} (4.1) & f_{U, V} (u, v) = f_{X, Y} (S (u, v)) | det J_{S} (u, v) | . \end{matrix}$

Example

Let $X \sim Gamma (α_{1}, β), Y \sim Gamma (α_{2}, β)$ , and $X ⊥ ⊥ Y$ . Let $U = X + Y, V = \frac{X}{X + Y}$ . Then $X = U V, Y = U (1 - V)$ .
So $S_{1} (u, v) = u v, S_{2} (u, v) = u (1 - v)$ , and $J_{S} (u, v) = [\begin{matrix} v & u \\ 1 - v & - u \end{matrix}]$ , $| det J_{S} (u, v) | = | u |$ .
Plug in (4.1), $\begin{aligned} f_{U, V} (u, v) = & \underset{f_{X} (u v) f_{Y} (u (1 - v))}{\underset{⏟}{f_{X, Y} (u v, u (1 - v))}} u 1 {u \in (0, + \infty)} 1 {v \in (0, 1)} \\ = & \frac{β^{α_{1}} (u v)^{α_{1} - 1} e^{- β u v}}{Γ (α_{1})} \frac{β^{α_{2}} [u (1 - v)]^{α_{2} - 1} e^{- β u (1 - v)}}{Γ (α_{2})} \\ u 1 {u \in (0, + \infty)} 1 {v \in (0, 1)} \\ = & \underset{Gamma (α_{1} + α_{2}, β)}{\underset{⏟}{\frac{β^{α_{1} + α_{2}} u^{α_{1} + α_{2} - 1} e^{- β u}}{Γ (α_{1} + α_{2})} 1 {u \in (0, + \infty)}}} \\ \underset{Beta (α_{1}, α_{2})}{\underset{⏟}{\frac{Γ (α_{1} + α_{2})}{Γ (α_{1}) Γ (α_{2})} v^{α_{1} - 1} (1 - v)^{α_{2} - 1} 1 {v \in (0, 1)}}} . \end{aligned}$
So $\begin{aligned} f_{U, V} (u, v) & = f_{U} (u) f_{V} (v) \\ \Rightarrow F_{U, V} (u, v) & = F_{U} (u) F_{V} (v), \forall (u, v) \in R^{2}, \end{aligned}$ so $U ⊥ ⊥ V$ .

In summary,

$X + Y \sim Gamma (α_{1} + α_{2}, β)$ ,

$\frac{X}{X + Y} \sim Beta (α_{1}, α_{2})$ ,

$X + Y ⊥ ⊥ \frac{X}{X + Y}$ .